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Abstract 

Two general algorithms based on opportunity costs are given for approximating a revenue- 
maximizing set of bids an auctioneer should accept, in a combinatorial auction in which each 
bidder offers a price for some subset of the available goods and the auctioneer can only accept non- 
intersecting bids. Since this problem is difficult even to approximate in general, the algorithms 
are most useful when the bids are restricted to be connected node subsets of an underlying object 
graph that represents which objects are relevant to each other. The approximation ratios of 
the algorithms depend on structural properties of this graph and are small constants for many 
interesting families of object graphs. The running times of the algorithms are linear in the size of 
the bid graph, which describes the conflicts between bids. Extensions of the algorithms allow for 
efficient processing of additional constraints, such as budget constraints that associate bids with 
particular bidders and limit how many bids from a particular bidder can be accepted. 



1 Introduction 



Auctions are arguably the simplest and most popular means of price determination for multilateral 
trading without intermediary market makers |], 25|, 3S|] . This paper considers the setting where 
there are (1) a group of competing bidders who bid to possess the auction objects and (2) an auctioneer 
who determines which bidders win which objects. 

For the case of allocating a single object to one of many bidders, there is a wealth of literature 
on the following four widely used forms of auction [18,25], 26 1. In an English auction or ascending 
bid auction, the price of an object is successively raised until only one bidder remains and wins 
the object. In a Dutch auction, which is the converse of an English auction, an initial high price 
is subsequently lowered until a bidder accepts the current price. In a first-price sealed-bid auction, 
potential buyers submit sealed bids for an object. The highest bidder is awarded the object and 
pays the amount of her bid. In a second-price sealed-bid auction, the highest bidder wins the object 
but pays a price equal to the second-highest bid. In all these forms of auction, the auctioneer can 
determine the winning bid in time linear in the number of bids in a straightforward manner. 

For the case of allocating multiple objects to multiple bidders [^,|l^,[l7|,^0, 21,27], combinatorial 
auctions are perhaps the most important form of auctions in the Internet Age, where bidders are 
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increasingly software agents. Oftentimes a bid by an agent is a subset of the auction objects, and 
the agent needs the entire subset to complete a task. Different bids may share the same object, 
but the winning bids must not share any object [p4| . Combinatorial auctions were first proposed 
by Rassenti et al. [pGfl as one-round mechanisms for airport time slot allocation. Banks et al. ||, 
DeMartini et al. |10|| , and Parkes and Ungar |2^] formulated multiple-round mechanisms. It is in 
general ./VP-hard for the auctioneer to determine a set of winning bids of a combinatorial auction 
which maximizes the revenue of the auction. To address this computational difficulty, Rothkopf 
et al. [^] placed constraints on permissible bids. Lehmann et al. [^] and Fujishima et al. [11] 
considered approximation algorithms. Sandholm and Suri [34] designed anytime algorithms, which 
return a sequence of monotonically improving solutions that eventually converges to optimal. 

In this paper, we propose a general framework to exploit topological structures of the bids to 
determine the winning bids with a provably good approximation ratio in linear time. The following 
discussion uses the sale of a car as a light-hearted example to explain our computational problems 
and key concepts. 

Imagine that we are in the business of auctioning used cars. If we insist on selling each car as a 
unit, we can sell each car to the highest bidder. If we are willing to sell parts of the car, we can still 
sell each part to the highest bidder. But suppose that some bidders are only interested in buying 
several parts at once: Alice may not want to buy a tire unless she can get the wheel that goes with 
it, while Bob might only be interested in both rear wheels and the axle between them. How do we 
decide which of a set of conflicting bids to accept? 

We will assume that our only goal is to maximize our total revenue. Then we can express this 
problem as a simple combinatorial optimization problem. We have some universe O of objects, and 
our buyers supply us with a set A of bids. The i-th bid consists of a subset A{ of O and a price pi 
that the buyer is willing to pay for all of the objects in Si. We would like to choose a collection of 
bids B C A that yields the best possible total price while being consistent, in the sense that no two 
sets Ai and Aj in B overlap. 

As the auctioneer, we can construct a bid graph G whose nodes are the bids and which has 
an edge between any two bids that share an object. Then, a set of consistent bids is simply an 
independent set in G, i.e., a set of nodes no two of which are connected by an edge. Each node is 
given a weight equal to the value of the bid it represents. 

Sadly, this means that the problem of finding the most valuable consistent set of bids is a thinly- 
disguised version of the maximum weight independent set problem, which is not only AP-hard, but 
cannot be approximated to within a ratio 0(n 1_<E ) for an n-node graph unless P = NP [|1J.[] Even 
for the simplest case when all node weights are one, the maximum weight independent set problem 
is AP-hard even when every vertex has degree at most d for any d > 3, and in fact cannot be 
approximated within a ratio of d £ for some e > unless P = NP Q . The best known algorithm (for 
arbitrary d) achieves an approximation within a factor of 0(d/ log log d) [jOf . As a result, it seems 
hopeless if we model our combinatorial auction problem as an independent set problem unless we 
exploit the topological structure of the underlying bid graph. 

Using ideas from the interval selection algorithm of Berman and DasGupta [[?J, we describe in 
Section [2] a linear-time improvement of the greedy algorithm, called the opportunity cost algorithm, 
for approximating maximum weight independent sets in ordered graphs.^ We then describe a similar 
algorithm called the local ratio opportunity cost algorithm, based on ideas from the resource allocation 

The fact that the bid graph is defined by the intersections of a collection of sets does not by itself help; any graph 
can be defined in this way. 

2 These are graphs in which the nodes have been assigned an order; as we will see in Section 3.5, the choice of order 
for a given bid graph can have a large effect on how good an approximation we can get. 
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algorithms of Bar-Noy et al. 0]. Both algorithms produce the same output, but the first has a more 
iterative structure and is easier to implement while the second has a more recursive structure and is 
easier to analyze. 

These opportunity cost algorithms distinguish themselves from the straightforward greedy algo- 
rithm by taking into account the cost of excluding previously considered neighbors of a chosen node. 
Since this accounting requires propagating information only between neighbors, it increases the run- 
ning time by at most a small constant factor, and yet in many cases produces a great improvement 
in the approximation ratio. The quality of the approximation depends on the local structure of the 
ordered input graph G. For each node v in G, we examine all of its successors (adjacent nodes that 
appear later in the ordering). The maximum size of any independent set among v and its successors 
is called the directed local independence number at v; we will write it as (3{v). The maximum value 
of (3{v) over all nodes in the graph will be written as /3(G) ,0 and is the directed local independence 
number of G. Our algorithms approximate a maximum weight independent set to within a factor of 
(3. By comparison, the greedy algorithm approximates a maximum weight independent set within a 
ratio of the maximum size of any independent subset of both the predecessors and the successors of 
any node, which in general can be much larger than (3 (see Section |J). 

These new approximation results are useful only if we can exhibit interesting classes of graphs 
for which (3 is small. Graphs with (3 = 1 have been extensively studied in the graph theory litera- 
ture; these are known as chordal graphs, and are precisely those graphs that can be represented as 
intersection graphs of subtrees of a forest, a class that includes both trees and interval graphs (more 
details are given in Section jO|). We give additional results showing how to compute upper bounds 



on j3 for more general classes of graphs in Sections 3.2 and 3.3. 

Among these tools for bounding 0, one of particular interest to our hypothetical combinatorial 
auctioneer is the following generalization of the fact that intersection graphs of subtrees have (3 equal 
to one. Suppose that we have an object graph whose nodes are objects and in which an edge exists 
between any two objects that are relevant to each other in some way. (In the car example, there 
might be an edge between a wheel and its axle but not between a wheel and the hood ornament.) We 
demand that the objects in each bid be germane in the sense that they must form a connected node 
subset of the object graph. For many sparse object graphs, the intersection graph of all connected 
sets of vertices can be ordered so that a later set intersects an earlier set only if it intersects a "frontier 
set" that may be much smaller than the earlier set. It is immediate that (3 for the intersection graph 
is bounded by the size of the largest frontier set (more details are given in Lemma ^) . Examples of 
such graphs are those of low treewidth (Theorem |{|) and planar graphs (Corollary |l0|). 

In Section |I] we show how to handle more complex constraints on acceptable sets of bids. We 
investigate scenarios where bids are grouped by bidder, and that each bidder is limited to some 
maximum number of winning bids (an unweighted budget constraint), or some maximum total cost 
of winning bids (a weighted budget constraint). By charging later bids an approximate opportunity 
cost for earlier bids in the same budget groups, we can solve these problems approximately with ratio 
(3 + 1 with unweighted constraints and 2(3 + 3 for weighted constraints. The results for unweighted 
budget constraints can be further generalized for more complicated constraints. 

Finally, in Section |H| we discuss some open problems suggested by the current work. 



3 Or simply f3 when G is clear from the context. 
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2 Simple combinatorial auctions 



In this section, we describe our algorithms for approximating the maximum weight independent 
set problem, the opportunity cost algorithm and the local ratio opportunity cost algorithm . Both 
algorithms return the same approximation. 

2.1 The opportunity cost algorithm 

We will write u — > v if uv £ E and call u a predecessor of v and v a successor of u. The set of all 
predecessors of u will be written as 5~(u) and the set of all successors as 8 + (u). 

Given a directed acyclic graph Go = (Vo,Eo) with weights weight (v) for each v in V, the oppor- 
tunity cost algorithm, Opcost, proceeds in two stages: 

OC1 Traversing the nodes according to the topological order of Go, compute a value value(u) for 
each node u. This value represents an estimate of the gain we expect by including u in the 
independent set; it is computed by taking the weight of u and subtracting off an opportunity 
cost consisting of the values of earlier positive-value nodes that conflict with u. Formally, let 



value(u) = weight(u) — max(0, value(u)). (1) 



OC2 Processing the nodes in reverse topological order, add any node with non-negative value to the 
desired independent set B and discard its predecessors. Formally, let 

select (u) = [value (u) > 0] A \/v G 5 + {u) : -> select (v). (2) 
The output of the algorithm is the set B defined as all u for which select (u) is true. This set B 



is clearly independent. In Section 2.3, we examine how close B is to optimal. 



2.2 The local ratio opportunity cost algorithm 

The local ratio technique can be used to recursively find approximate solutions to optimization 
problems over vectors in M n , subject to a set of feasibility constraints. It was originally developed 
by Bar- Yehuda and Even [||, and later extended by Bafna et al. Q, Bar- Yehuda ||, and Bar-Noy 
et al. @. 

Let w € M n be a weight vector. Let F be a set of feasibility constraints. A vector x £ M n is a 
feasible solution to a given problem (F, w) if it satisfies all the constraints in F. The w-weight of a 
feasible solution x is defined to be the dot-product w ■ x; for r > 1, x is an r- approximation with 
respect to (F,w) if r • w ■ x > w ■ x*, where x* is a feasible solution maximizing the it;- weight. An 
algorithm is said to have an approximation ratio of r if it always returns an r-approximate solution. 

Lemma 1 (Local Ratio Lemma [^]) Let F be a set of feasibility constraints. Let w, w\ and W2 

be weight vectors such that w = w\ +u>2- If % is an r- approximation with respect to (F,wi) and 
(F,W2), then x is an r- approximation with respect to (F,w). 

We now describe the local ratio opportunity cost algorithm, LR-Opcost. Given a directed 
acyclic graph Go = (Vq,Eq) with weights weight (v) for each v E Vq, we pass (Go, weight (•)) to the 
following recursive procedure. This procedure takes as input a graph G and a weight function w and 
proceeds as follows: 
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LR1 Delete all nodes in G with non-positive weight. Let this new graph be 
LR2 If G2 has no nodes, return the empty set. 

LR3 Otherwise, select a node u with no predecessors in G2, and decompose the weight function w 
as w = w± + W2, where 



and u>2 = w — w\. 

LR4 Solve the problem recursively using (G^,^) as input. Let B2 be the approximation to a 
maximum weight independent set returned by this recursive call. 

LR5 If £>2 U {u} is an independent set, return B = B2 U {u}. Otherwise, return B = B2- 

Theorem 2 Opcost and LR-Opcost return the same approximation to a maximum weight inde- 
pendent set. 

Proof: Consider a recursive call C of LR-Opcost. Let u be the node that is selected to be 
processed in step LR3. All of u's predecessors in the original graph Go have either been processed 
in a previous step LR3 or deleted in some step LR1. Therefore, the current weight of u, w(u), as 
seen by the recursive call C, is just value(u), as defined in step OC1 of Opcost. Furthermore, we 
add node u to our independent set in step OC2 if and only if we add u to our independent set in 
step LR5. I 

2.3 Approximation ratios 

Theorem 3 Opcost and LR-Opcost return a (5(G) -approximation to a maximum weight inde- 
pendent set. Furthermore, there exist weights for which this bound is tight. 

Proof: We will prove the result for LR-Opcost. The full result follows from Theorem ||. 
Clearly, the returned set of nodes B is an independent set. By Lemma |], we need only show that B 
is a /3-approximation with respect to w\ and W2- We will prove this by induction on the recursion. 
The base case of the recursion is trivial, since there are no positive weight nodes. 

For the inductive step, assume that B2 is a (3- approximation with respect to W2- Then B is also 
a /3-approximation with respect to W2 since 102 (u) = and B C B2 U {u}. 

To show that B is a /3-approximation with respect to w±, we will derive an upper bound (5w(u) on 
the maximum u?i-weight independent set and a lower bound w(u) on the ui-weight of any u-maximal 
independent set of nodes. A u-maximal independent set of nodes either contains u or adding u to it 
violates the property that it is an independent set. Our w\ performance bound is (3w(u)/w(u) = j3. 
Note that only u and its successor nodes will have a nonzero contribution to W\ -weight. 

The total weight of a maximum u>i-weight independent set is at most f3(u)w(u) < f3(G)w(u) = 
(3w(u). The total weight of any u-maximal independent set is at least w(u), since any such set 
contains at least one element of u U S + (u), and all such nodes are assigned weight w(u). Since the 
algorithm always chooses a u-maximal set, its W\ performance bound is /3. 

To show the bound is tight, pick some v that maximizes (3(v), and assign it weight 1 and all of 
its successors weight 1 — e, where e > 0. Let every other node in G have weight 0. When we run 
Opcost, the value of v will be 1, the value of each of its successors will be — e, and the value of any 
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other node is irrelevant because it has zero weight. Thus Opcost returns a set of total weight 1 but 
the maximum weight independent set has total weight at least (3{u) • (1 — e). I 

2.4 Running time 

Theorem 4 The running times of both Opcost and LR-Opcost are linear in the size of the input 
graph Gq. 

Proof: Opcost computes value(f) for each node v in time proportional to its indegree, 
and computes select (v) for each node in time proportional to its outdegree, for a total time of 
0(| Vo| + l-Ebl)- In the case of LR-Opcost, a recursive call is made at most once for each node in 
the graph, and defining wi and W2 in each call takes time proportional to the node's outdegree, for 
a total running time of 0(| Vq | + |-Eb|). I 



3 Properties of (3 

For any v, (3(v) is at most the larger of 1 or the outdegree of v. Thus, (3(G) is at most the larger of 
1 or the maximum degree of G. In many cases we can use the structure of G to get a much better 
bound. 

3.1 Graphs with (3 = 1 

Graphs with orientations for which (3 = 1 can be characterized completely. These are the chordal 
graphs, also known as triangulated graphs or rigid circuit graphs. The defining property of a chordal 
graph is that no cycle of length 4 or more appears as an induced subgraph. A succinct discussion 
of these graphs, including a variety of characterizations as well as several examples of interesting 
families of chordal graphs, can be found in El pp. 280-281]. For our purposes the most useful of 



these characterizations are stated in the following lemma: 

Lemma 5 Let G be an undirected graph. Then the following properties of G are equivalent: 

1. G is chordal. 

2. G is the intersection graph of subtrees of a forest. 

3. G has an ordering G' for which the successors of any node form a clique. Such an ordering is 
called a perfect elimination ordering. Restated in terms of (3, G has an ordering G' for which 
P(G') = 1. 

Proof: See @, pp. 280-281]. I 

Chordal graphs can be recognized and ordered using a specialized version of breadth-first search 



in 0(| V] + \E\) time as shown by Rose et al. [32], and their maximum cardinality independent sets 



can be computed in 0(|V| + \E\) time as shown by Gavril |13|]. Gavril's algorithm is essentially 
the same as step OC1 of the opportunity cost algorithm; it chooses all nodes with positive value 
and works because the sets {v : u — ► v } for each u in the independent set form a clique covering. 
However, this algorithm does not deal with weights. 

Special cases of graphs with = 1 include trees, interval graphs, and disjoint unions of cliques. 
The last are particularly nice: 
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Lemma 6 Let G be a disjoint union of cliques. Then every orientation G' of G has /3(G') = 1. 
Proof: For each u in G', 5 + (u) is a clique. I 

3.2 Graphs with larger (3 values 

For general graphs, we cannot compute (3 even approximately. However, we can bound the (3 values 
of many graphs using the tools in this section. 

Lemma 7 Let G be a directed graph. 

1. IfG = GiUG 2 , then [3(G) < (3(d) + (3(G 2 ). 

2. If G is a node-induced subgraph of H , then (3(G) < (3(H). 

Proof: Let u be a node of G. Let 5 + (u), df(u), <5^(u), and 5^(u) be the set of all successors 
of u in G, Gi, and G 2 , respectively. Let A be any independent subset of 5 + (u). Then 

1. \A\ <\AD <$+(«) I + \A n 8+(u)\ < I3(G{) + (3(G 2 ), and 

2. A is an independent subset of 5]j(u), implying \A\ < (3(H). 

I 

Lemma 8 Let G be the intersection graph of a set system A whose union is O. Let G be ordered 
by an ordering < such that for each A € A there exists a "frontier set" Sa Q U of size at most k, 
so that if A < B and An B ^ 0, then S A r\ B / 0. Then (3(G) < k. (Note that S A need not be 
contained in A.) 

Proof: Let B\, . . . ,Bi be some independent set of successors of A. Under the conditions of the 
lemma each Bi intersects Sa- But since the Bi do not themselves intersect, each must intersect 5,4 
in a distinct element. Thus there are at most k of them. 1 

The converse of the lemma does not hold. Instead, its proof shows that the clique covering 
number x of S + (A) (defined as the minimum size of any set of cliques whose union is 8 + (A)) is 
at most k, since the set of all B that intersect Sa at any particular element form a clique. Note 
that any directed acyclic graph in which x(b~ + ( v )) is bounded can be represented as an intersection 
graph with small frontier sets as in Lemma HJ^] in general the independence number of 5 + (v) may 
be smaller than the clique covering number. 

When A consists of connected node subsets of some graph H, we can obtain good orderings of 
the intersection graph G of A by exploiting the structure of H. 

We start by reviewing the definition of treewidth. A tree decomposition of an undirected graph 
H = (V, E) consists of a tree T and a family of sets V = {Vt} where t ranges over nodes of T, 
satisfying the following three properties: 

1- Ute T Vt = V. 

2. For every edge uv in E, there is some Vt that contains both u and v. 

4 The trick is to add a new common element to all members of each clique, and let Sa be the set of all such new 
elements for the cliques that cover 5 + (A). 
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3. If t2 lies on the unique path from t\ to £3 in T, then Vt 1 H Vt 3 C Vi 2 . 

The width of a tree decomposition (T, V) is max \Vt\ — 1. The treewidth tw(H) of a graph is 
the smallest width of any tree decomposition of . 

Theorem 9 If G is an intersection graph of connected node subsets A of some graph H with 
treewidth k, then there is an orientation G' of G with (3{G') < k + 1. Given A = {A{\ and a tree de- 
composition (T, V = {Vt}) of H , this orientation can be computed in time 0(%2i \ Ai \ + \T\ + J2 t \ Vt\), 
which is linear in the size of the input. 

Proof: Let (T, V) be a tree decomposition of H with width k. We will use this tree decompo- 
sition to construct an ordering of the connected node subsets of H, with the property that if A < B 
then either A n B = or B intersects some frontier set Sa with at most k + 1 elements. The full 
result then follows from Lemma ||. 

Choose an arbitrary root r for T, and let t\ > ti if t\ is an ancestor of t<i in the resulting rooted 
tree. Extend the resulting partial order to an arbitrary linear order. For each connected node subset 
A of H, let tj\ be the greatest node in T for which Vt A intersects A. Given two connected node 
subsets A and B of H, let A < B if tjy < ts and extend the resulting partial order to any linear 
order. 

Ordering T can be done in 0(|T|) time using depth first search. We can then compute and the 
maximum node in T containing each node of H in time 0(^2 t Vt) by considering each Vt in order. 
The final step of ordering the Ai in the given set system S takes 0(^ li \ Ai\) time, since we must 
examine each element of each Ai to find the maximum one. The total running time is thus linear in 
the size of the input. 

Now suppose A < B in this ordering. We will show that any such B intersects Vt A , and thus 
that Vt A is our desired frontier set Sa- There are two cases. 
If t a = ts , we are done. 



The case £4 < ts is more complicated. We will make heavy use of a lemma from [31|, which 
concern the effect of removing some node t from T. Their Lemma 2.3 implies that if not 
in Vt, then either x and x' are separated in H by Vt or x and x' are in the same branch (connected 
component) of T — t. 

Let p be the parent of tj± (which exists because Ia is not the greatest element in the tree ordering). 
We have Af] V p = since p > tA- Since A is a connected set, it cannot be separated without removing 
any of its nodes; thus by Lemma 2.3 every element of A is in the same branch olT—p, which consists 
precisely of the subtree of T rooted at tA- 

Now B contains at least one node x in the vertex set of an element of the subtree rooted at tA, 
and at least one node x' in Vt B , which is not in this subtree because ts > tA- So by Lemma 2.3 
of [|3lJl , either one of x, x' is in Vt A or B is separated by V% A . In the latter case B intersects Vt A since 
B is also connected. I 

Applying Theorem |9| to planar graphs gives: 

Corollary 10 If G is the intersection graph of a family A of connected node subsets of a planar 
graph H with n nodes, then there is an orientation G' of G with f3(G') = 0(y/n). Given H, a data 
structure of size 0(n) can be precomputed in time 0(n log n) that allows this orientation G' to be 
computed for any A = {Ai} in time 0(%2i I^M)- 



Proof: Reed |3(J gives a recursive 0(n log n) algorithm for computing tree decompositions of 
constant-treewidth graphs based on a linear time algorithm for finding approximate separators for 
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small node subsets. Replacing this separator-finding subroutine with the linear time algorithm of 
Lipton and Tarjan |23| gives an 0(n log ri) time algorithm for computing a tree decomposition of a 
planar graph. Since each separator has size at most k = 0(y/n), the resulting tree decomposition 



has width at most 4k = 0{y/n) by Theorem 1 of [ 30 1 . 



Since all we need to compute a good ordering of A is the ordering of the n nodes, we can compute 
this ordering as described in the proof of Theorem || and represent it in O(n) space by assigning 
each node an index in the range 1 to n. Ordering A then takes linear time as described in the proof 
of Theorem ^. I 

3.3 Examples 



Applying the results of Sections 3.1 and 3.2 gives: 



A linear-time algorithm for finding a maximum weight independent set of an interval graph, 
since f3{G) = 1 by Lemma [|, and since chordal graphs can be recognized and ordered in linear 



time using the work of Rose et al. [ 32 1 



While the maximum independent set problem is easily solved for this case (for example, by 
using the linear time interval graph recognition algorithm of Hsu and Ma fl9| followed by a 
simple application of dynamic programming) this is an example of how our general method 
yields good algorithms as special cases. 

2. As another special case, a 2-approximation algorithm for interval selection of Berman and 
DasGupta @. Here intervals are partitioned into groups and we must choose non-overlapping 
intervals with at most one per group. The bid graph G is of the form G\ U G2 where G\ is 
an interval graph and G2 is a disjoint union of cliques, one for each group. Thus /3(G) = 2 by 
Lemmas [|, ^, and |7| 

3. A 3- approximation algorithm for "double auction" interval selection where each interval has 
both a seller and a buyer, and at most one interval per seller or buyer may be selected. This 
is the same as the previous case except the graph is now G\ U G2 U G3 where G2 and G3 are 
both disjoint unions of cliques. 

4. In general, a mechanism for taking any bid graph with (3 = k and adding up to m such unique- 
selection constraints to get a (k + m)-approximation algorithm by repeated applications of 
Lemmas [6] and |7| So for example we get a 3- approximation algorithm for maximum weight 
three-dimensional matching and a 4-approximation algorithm for auctioning off tracts of unde- 
veloped land spanning intervals where each tract must be acceptable to a seller who provides 
it, a builder who will develop it, and a buyer who will ultimately purchase both the land and 
the buildings developed on it. 

5. An algorithm to ^'-approximate a maximum weight independent set of any subgraph of a k- 
dimensional rectangular grid. Orient each edge to leave the point whose coordinates have a 
smaller sum, giving (3 < k. 

6. A linear-time algorithm for 2-approximating a maximum weight independent set of the in- 
tersection graph of intervals on a cycle. This follows from Lemma |8|: order connected node 
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subsets by inclusion, extend to a linear order -<, and observe that if A -< B and A intersects 
B then B intersects one of A : s two endpoints.^] 

7. An algorithm for intersection graphs of bounded-height rectangles in a discrete 2D grid. Order 
the rectangles by their largest x-coordinate, and make the rightmost grid points of each rect- 
angle be its frontier set in the sense of Lemma |§[ If each rectangle is at most h tall, there are at 
most h grid points in each frontier. This generalizes in the obvious way to higher dimensions 
given bounds on all but one of the coordinates, in which case the approximation ratio becomes 
the product of the bounds. 

3.4 Hardness of computing (3 

The difficulty of even approximating the independence number of a graph extends to the directed 
local independence number. 

Theorem 11 Any algorithm that can approximate [3(G) for an n-node directed acyclic graph G 
with a ratio of f(n) can be used to approximate the size a(H) of a maximum independent set of 
an undirected n-node graph H with ratio f(n + 1). Thus by Hdstad's bound on approximating a 
maximum clique JJ^/ ; we cannot approximate (3 by 0(n 1_e ) unless P = NP. 

Proof: Given an undirected n-node graph H, construct an (n + l)-node directed acyclic graph 
G by (a) directing the edges of H in any consistent order, and (b) adding a new source node s to H 
with edges from s to every node in H. 

Let / be an independent set in H. Then every node in / is a successor of s in G, and furthermore 
these nodes are all independent. It follows that (3(G) > (3(s) > a(H). 

Conversely, if /' is an independent set of successors of some node v in H, it cannot contain s 
(since s is not a successor of any node), and thus I' is also an independent set in H. So we have 
a(H) > /3(G). I 



3.5 Effects of node ordering 

The performance of the opportunity cost algorithm is strongly sensitive to the order in which the 
nodes are processed, as this affects the value of (3(u) for each node u. For many of the examples 



given in the Section 3.3, a good ordering is provided by the structure of the problem. But what 



happens in a general graph? 

Theorem 12 For any graph G with given weights, there exists an orientation G' of G for which 
both Opcost and LR-Opcost output a maximum independent set of G. 

Proof: Let A be any independent set in G. Choose the ordering so that all nodes in A 
precede all nodes not in A. Then for any u £ A, u has no predecessors in the oriented graph and 
value (u) = weight (u). 

Let A' be the independent set computed by the algorithm. If u is in A but not A' , it must have 
a successor v in A' — A with non-negative value. Since the value of each v is its weight less the 
weight of all its neighbors in A, the total weight of all elements of A' — A must exceed the total 



5 One can do better by breaking the cycle to reduce it to a standard interval graph problem (see, for example, the 
approach taken by jij ) , but the 2-approximation shows how one can still do reasonably well with our general algorithms 
Opcost and LR-Opcost. 
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weight of all elements in A — A' , and we have weight(^4') = weight(A' — A) + weight(^4' n A) > 
weight (A - A') + weight (A' n A) = weight (A). I 

In a sense what Theorem |l2| shows is that finding a good ordering of a general graph is equivalent 
to solving the maximum weight independent set problem. This is not surprising since evaluating 
(3(u) for even a single node u requires solving this problem. It follows that to get small approximation 
ratios we really do need to exploit some special property of the given graph. 

In the other direction, we can show that there exist orderings that are not very good: 

Theorem 13 If all nodes in a graph G have distinct weights, orienting G in order of decreasing 
weight causes Opcost and LR-Opcost to return the same independent set as the greedy algorithm. 

Proof: We will prove the result for Opcost; by Theorem ^ the same result holds for LR-Op- 
COST. 

Let 7T order the nodes in order of decreasing weight. Let us show by induction on tt that if the 
greedy algorithm chooses a node v, then value(u) = weight (v); but if the greedy algorithm does 
not choose v, then value(w) < 0. Suppose we are processing some node v and that this induction 
hypothesis holds for all nodes previously processed. If the greedy algorithm picks v, then all u's 
predecessors were not chosen and have negative value, and value(u) = weight (v). If the greedy 
algorithm does not pick v, it is because it chose some u — ► v; now value(u) < weight (v) — value(u) = 
weight (v) — weight (u) < 0. 

Since the only nodes with non-negative weights are those chosen by the greedy algorithm, Op- 
COST selects them as its output. I 

4 Auctions with budget constraints 

Consider the following bidding scenarios: 

1. A bidder whose car has broken down wants to buy either a new engine, a new car, or an 
umbrella and a taxi ride home, but doesn't particularly care which. However, she has no 
interest in winning more than one of these bids. 

2. Another bidder wants to buy at most three 1968 Volkswagen Beetle hood ornaments, but she 
would like to bid on all that are available so as not to miss any. 

3. Yet another bidder has only $100 in cash, but would like to place multiple bids totaling more 
than $100, with the understanding that she can only win bids up to her budget. 

All of these are examples of budget constraints, in which bids in some group consume a common 
scarce resource. We would like to extend our algorithms to handle such constraints, which are natural 
in real- world bidding situations. 

The first scenario is an example of a 1-of-n constraint, where at most one of a set of n bids can 
be accepted. This special case can be handled by modifying G by forming a clique out of all bids 
in each set Sf, under the assumption that the S{ are disjoint, this increases (3 by at most 1 (using 
Lemmas ^ and [?]). The second scenario depicts a more general k-of-n constraint. Such constraints 
are handled by extending our algorithms to account for the possible revenue loss from bids that 
cannot be selected because the budget constraint has been exceeded. Again, the approximation 
ratio rises by 1. We refer to both 1-of-n and k-oi-n constraints as unweighted budget constraints, as 
each bid consumes a single unit of the budget. 



11 



Weighted budget constraints, exemplified by the third scenario, are more complicated. With such 
constraints, we must ensure that the sum of the weights of accepted bids in some group S is at most 
some bound b. A complication arises because a maximal allowed set of bids might only fill half of 
a budget limit. With some additional modifications to our algorithms, we get a performance bound 
of 2/3 + 3. 

4.1 Unweighted budget constraints 

Suppose the bids are partitioned into groups Si,... ,S r and that no more than h L bids may be 
selected from Si, for 1 < i < r. For each bid u, let g(u) denote the index of the group to which u 
belongs and let S u = S g r u \ and k u = k g ^ . 

Unweighted-Opcost is an extension of Opcost to handle unweighted budget constraints. It 
has a similar two-step structure. 

In the first step, like OC1, we traverse the nodes in topological order and compute a value for 
each node. We must extend the definition of value for each node to account for the possible revenue 
loss from previously processed bids that may not be selected in the second step because of the budget 
constraint: 

value(n) = weight(n) — max(0, value(w)) — — • max(0, value(t> )), (3) 

v->u u veS u -{u},v<u 

where the notation v < u means that v has already been processed (before u). Note that the 
inclusion of u in the set of winning bids does not necessarily preclude previously processed bids in 
S u from also being selected — they may also be selected if the budget k u allows. The coefficient 
scales the opportunity cost to account for this fact. 

In the second step, like OC2, we traverse the bid graph in reverse topological order, selecting 
nodes of positive value whose addition to those already selected does not violate the independence 
or budget constraints. 

Unweighted-LR-Opcost solves the same problem using the local ratio technique. It follows 
the same structure as LR-Opcost. We begin by deleting all non-positive weight nodes from the 
graph. If any nodes remain, we select a node u with no predecessors, and decompose the weight 
function into w = Wi + u>2 ■ This time, the decomposition must account for bids that are in the same 
budget group. We define 

(w(u) if v € {u} U 6 + (u), 
T^w(u) tfveS u -{u}, 
otherwise, 

and recursively solve the problem using u>2 as the weight function. After the recursive call, we must 
decide if we should add u to the set of winning bids £>2- In LR-Opcost, we added u to B2 if and 
only if B2 U {u} was an independent set. In this algorithm, we must also ensure that the budget 
constraints are satisfied before adding u to 82- We say that a set of bids is feasible if they form an 
independent set and the budget constraints are satisfied. 

Theorem 14 Given a directed bid graph G, a partition of the nodes of G into nonempty subsets 
Si,... ,S r , and an unweighted budget constraint fej for each Si, 

1. Unweighted-Opcost and Unweighted-LR-Opcost return the same approximation to a 
revenue maximizing set of bids. 
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2. Unweighted-Opcost and Unweighted-LR-Opcost (/3(G) + 1)- approximate an optimal 
set of bids. 

3. Unweighted-Opcost and Unweighted-LR-Opcost run in time linear in the size of G. 

Proof: The proof that both algorithms return the same approximation is similar to the proof 
of Theorem ||. 

The proof of the approximation ratio follows the same structure as the proof of Theorem ||. We 
prove the result for Unweighted-LR-Opcost. By Lemma |], we need only show that the returned 
set of bids B is a (/? + l)-approximation with respect to W2 and w\. We do this using induction on 
the recursion. The fact that B is a (/? + l)-approximation with respect to W2 follows trivially from 
the inductive assumption. 

In the case of w\, we will derive an upper bound U on the maximum ^i-weight of a set of feasible 
bids and a lower bound L on the u>i-weight of any u-maximal set of bids. A n-maximal set of bids 
either contains u or adding u to it would violate the feasibility constraints. In the case of a set of 
feasible bids, its total wi-weight is at most (3{u)w(u) + y^w(u) < w(u){j3 + 1) = U, since the only 
nonzero contribution to i^i-weight comes from 5 + (u) and S u - In the case of a u-maximal set of 
bids, if u cannot be added to the set, then either (1) a successor of u is already in the set, in which 
case the total tui-weight is at least w(u), or (2) the budget constraint is exceeded, in which case the 
total t«i-weight is at least w(u). Therefore, the t^i-weight of these bids is at least w(u) and the W\ 
performance bound is 

VwW<P + X) = 
L w{u) 

The proof of the running time is similar to the proof of Theorem All of the steps that 
Unweighted-Opcost and Unweighted-LR-Opcost share with Opcost and LR-Opcost take 
linear time. Unweighted-Opcost adds the cost of computing the last term in (]3|). Storing 
J2 v eS- max (0, value(u)) in a variable Ag. for each Si allows this term to be computed in time 0(1) 
for each node, with an additional O(l) cost per node to update the appropriate Si- The same tech- 
nique allows budget constraints to be tested in O(l) time per node during the second step. Thus 
the additional time is linear. 

The corresponding modification to Unweighted-LR-Opcost similarly adds only linear time. 
Rather than updating the weight of each node v before each recursive call, we will compute the 
"current" weight of each node v as it is required, subtracting off the total weight Ag v of all previously- 
processed nodes in S v as in Unweighted-Opcost. I 



4.2 Overlapping unweighted constraints 



The analysis in Section |L1| assumes that the budget constraints partition the bids. For some appli- 
cations (e.g., bids involving matching up buyers with sellers), we may have overlapping constraints. 
Overlapping constraints may also be used to handle bids for identical items in limited supply, by 
grouping all bids asking for copies of the same item together. The algorithms described above can 
be generalized to handle overlapping constraints. 

Suppose we have a family of r sets of bids S = {Si, . . . , S r }, that each bid appears in at most t 
of these sets, and that at most ki bids may be accepted from set Si. 

In Overlapping-Unweighted-Opcost, when computing the value of a node u, we need to 
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account for the possible revenue loss from nodes in each set that u belongs to: 

value(u) = weight(n) — max(0, value (v )) — | -rr max(0, value )) J . 

v—*u l<i<r,ueSi \ v£S u ,v<u J 

The rest of the algorithm is the same as Unweighted-Opcost. 

In Overlapping-Unweighted-LR-Opcost, the only change from Unweighted-LR-Opcost 
is in the decomposition of the weight function. We decompose it as 



w\{v) = < 



w(u) if v £ {u} U 5 + (u), 

Yli<i<ruveSi 'F~ W ( U ) ^ there exist S{ containing both u and v, 
otherwise. 



Theorem 15 Given a directed bid graph G = (V, E), a family of nonempty node subsets S\, . . . ,S r , 
where each node appears in at most t of the Si, and an unweighted budget constraint ki for each Si, 

1. Overlapping-Unweighted-Opcost and Overlapping-Unweighted-LR-Opcost return 
the same approximation to a revenue maximizing set of bids. 

2. Overlapping-Unweighted-Opcost and Overlapping-Unweighted-LR-Opcost 
(/3(G) + t)- approximate an optimal set of bids. 

3. Overlapping-Unweighted-Opcost and Overlapping-Unweighted-LR-Opcost run in 
time 0{\V\t+\E\) 

Proof: Similar to the proof of Theorem [14|. The additional 0(|V|t) term comes from having 
to apply up to t budget constraints to each node; since ^ \ Si\ < \V\t, this term also covers the cost 
of reading the Si from the input and initializing the variables for each subset. I 



4.3 Weighted budget constraints 

Suppose that bids are partitioned into groups Si,... ,S r and that the total value of the winning 
bids from group i can be no more than 6j. For each bid u, let g(u) denote the index of the group to 
which u belongs and let S u = S g r u \ and b u = b g r u ) . 

This case is more complicated than the unweighted case. The difficulty arises when estimating a 
lower bound on the wi-weight of a u-maximal set of bids S. If u cannot be added to the set because 
the budget constraint will be exceeded, the w\ -weight of S can be as small as e, if Wi(u) = b u . 

We will describe changes required to LR-Opcost to handle this case. Corresponding changes 
can be made to Opcost. We will run variations of the algorithm twice, once for the heavy bids v 
with w(v) > ^b v and once for the light bids v with w(v) < ^b v . We then return the better of the 
two solutions. 

In Heavy- Weighted-LR-Opcost, we put an unweighted budget constraint of 1 on each bidder 
and run Unweighted-LR-Opcost. 

Lemma 16 Heavy- Weighted-LR-Opcost {(3 + 1)- approximates an optimal set of heavy bids. 
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Proof: Since each heavy bid consumes more than half a bidder's budget, each bidder can win 
at most one bid. This is just a simple unweighted budget constraint and can be solved as described 



in Section 4.1 for a performance bound of [3 + 1. I 



In Light-Weighted-LR-Opcost, when decomposing the weight function, we set 

w\{v) - 



w(u) if v G {u} U 5 + (u) 

^w(v)w(u) iiveS u -{u}, 
otherwise. 



Before adding u to the winning set of bids £>2, we must ensure that it does not conflict with other 
bids in £>2 and that the weighted budget constraint is not violated. The rest of the algorithm is 
identical to LR-Opcost. 

Lemma 17 Light- Weighted-LR-Opcost ((3 + 2) -approximates an optimal set of light bids. 

Proof: This proof uses the same structure and notation as the proof of Theorem [l4]. An upper 
bound U on the ^i-revenue of any feasible set of bids is w{u){(3 + 2). With regards to a u-maximal 
set of bids, if u cannot be added to the set because the budget constraint b u will be exceeded, the 
existing bids in the set must have weight at least b u /2, since w(u) <b u /2. A lower bound L on the 
w\ -revenue of any u-maximal set of bids is therefore w(u). The performance bound of this algorithm 
is j- = P + 2, as claimed. I 



Theorem 18 Given a directed bid graph G, a partition of the nodes of G into nonempty subsets 
Si, ■ ■ ■ ,S r , and a weighted budget constraint hi for each Si, 

1. Weighted-Opcost and Weighted-LR-Opcost return the same approximation to a revenue 
maximizing set of bids. 

2. Weighted-Opcost and Weighted-LR-Opcost (2(3(G) + 3) -approximate an optimal set of 
bids. 

3. Weighted-Opcost and Weighted-LR-Opcost run in time linear in the size of G. 

Proof: The sum of the optimal revenues for the heavy and light bids is at least equal to the 
optimum revenue among all bids. From Lemmas [l^ and |l7], the better of the two solutions will be 
within a factor of 2(3 + 3 of the optimum for the general problem. 

For the running time, observe that decomposing the bids into heavy and light bids takes linear 
time, that Heavy- Weighted-Opcost and Heavy- Weighted-LR-Opcost are equivalent to Un- 
WEIGHTED-Opcost and Unweighted-LR-Opcost and thus take linear time by Theorem |l^, and 
that Light- Weighted-Opcost and Light- Weighted-LR-Opcost can be made to run in linear 
time using techniques similar to those used for Unweighted-Opcost and Unweighted-LR-Op- 
COST. I 



5 Further Research 

This paper opens up several directions for further research. An immediate open problem is whether 
overlapping weighted budget constraints can be processed as efficiently as their unweighted counter- 



parts are processed in Theorem 15 
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It would be of importance to compare the performance of our algorithms and others in practice. 
The comparison could be conducted on simulations, but it would be more useful to analyze the 
performance on real auction data. 

As the examples of car sales and land sales demonstrate, topological structures exist in actual 
bids. Another good example is the FCC auction of airwaves in 1994 and 1995 [24], where each 
trading area is an auction object, the trading areas form a plane graph, and bidders prefer to 
acquire contiguous trading areas. It would be useful to examine past auctions to determine whether 
similar connectivity structures exist and how such structures affect the computational complexity of 
bidding strategies and winner determination. 
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